NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimizing Speaker Diarization for the Classroom: Applications in Timing Student Speech and Distinguishing Teachers from Children

https://doi.org/10.5281/zenodo.14871875

Wang, Jiani; Dudy, Shiran; Hu, Xinlu; Wang, Zhiyong; Southwell, Rosy; Whitehill, Jacob (January 2025, Journal of educational data mining)

An important dimension of classroom group dynamics & collaboration is how much each person contributes to the discussion. With the goal of distinguishing teachers' speech from children's speech and measuring how much each student speaks, we have investigated how automatic speaker diarization can be built to handle real-world classroom group discussions. We examined key design considerations such as the level of granularity of speaker assignment, speech enhancement techniques, voice activity detection, and embedding assignment methods to find an effective configuration. The best speaker diarization system we found was based on the ECAPA-TDNN speaker embedding model and used Whisper automatic speech recognition to identify speech segments. The diarization error rate (DER) in challenging noisy spontaneous classroom data was around 34%, and the correlations of estimated vs. human annotations of how much each student spoke reached 0.62. The accuracy of distinguishing teachers' speech from children's speech was 69.17%. We evaluated the system for potential accuracy bias across people of different skin tones and genders and found that the accuracy did not show statistically significantly differences across either dimension. Thus, the presented diarization system has potential to benefit educational research and to provide teachers and students with useful feedback to better understand their classroom dynamics.
more » « less
Full Text Available
Automated Evaluation of Classroom Instructional Support with LLMs and BoWs: Connecting Global Predictions to Specific Feedback

Whitehill, Jacob; LoCasale-Crouch, Jennifer (June 2024, Journal of Educational Data Mining)

With the aim to provide teachers with more specific, frequent, and actionable feedback about their teaching, we explore how Large Language Models (LLMs) can be used to estimate "Instructional Support" domain scores of the CLassroom Assessment Scoring System (CLASS), a widely used observation protocol. We design a machine learning architecture that uses either zero-shot prompting of Meta's Llama2, and/or a classic Bag of Words (BoW) model, to classify individual utterances of teachers' speech (transcribed automatically using OpenAI's Whisper) for the presence of Instructional Support. Then, these utterance-level judgments are aggregated over a 15-min observation session to estimate a global CLASS score. Experiments on two CLASS-coded datasets of toddler and pre-kindergarten classrooms indicate that (1) automatic CLASS Instructional Support estimation accuracy using the proposed method (Pearson R up to 0.48) approaches human inter-rater reliability (up to R=0.55); (2) LLMs generally yield slightly greater accuracy than BoW for this task, though the best models often combined features extracted from both LLM and BoW; and (3) for classifying individual utterances, there is still room for improvement of automated methods compared to human-level judgments. Finally, (4) we illustrate how the model's outputs can be visualized at the utterance level to provide teachers with explainable feedback on which utterances were most positively or negatively correlated with specific CLASS dimensions.
more » « less
Full Text Available
Tracking Classroom Movement Patterns with Person Re-ID

https://doi.org/10.5281/ZENODO.12729916

He, Xinlu; Wang, Jiani; Trinh, Viet Anh; McReynolds, Andrew; Whitehill, Jacob (January 2024, International Educational Data Mining Society)
Benjamin, Paaßen; Carrie, Demmans Epp (Ed.)
With the goal of supporting real-time AI-based agents to facilitate student collaboration, as well as to enable educational data-mining of group discussions, multimodal classroom analytics, and social network analysis, we investigate how to identify who-is-where-when in classroom videos. We take a person re-identification ( re-id ) approach, and we explore different methods of improving re-id accuracy in the challenging environments of school classrooms. Our results on a multi-grade classroom (MGC) dataset suggest that (1) fine-tuning off-the-shelf person re-id models such as AGW can deliver sizable accuracy gains (from 70.4\\% to 76.7\\% accuracy); (2) clustering, rather than nearest-neighbor identification, can yield accuracy improvements (76.7\\% to 79.4\\%) of identifying each detected person, especially when structural constraints are imposed; and (3) there is a strong benefit to re-id accuracy in obtaining multiple enrollment images from each student.
more » « less
Full Text Available
Automatic Speech Recognition Tuned for Child Speech in the Classroom

https://doi.org/10.1109/ICASSP48485.2024.10447428

Southwell, Rosy; Ward, Wayne; Trinh, Viet Anh; Clevenger, Charis; Clevenger, Clay; Watts, Emily; Reitman, Jason; D’Mello, Sidney; Whitehill, Jacob (April 2024, IEEE)

Full Text Available
Compositional clustering: Applications to multi-label object recognition and speaker identification

https://doi.org/10.1016/j.patcog.2023.109829

Li, Zeqian; He, Xinlu; Whitehill, Jacob (July 2023, Pattern Recognition)

Full Text Available
How to Give Imperfect Automated Guidance to Learners: A Case-Study in Workplace Learning

https://doi.org/10.1007/978-3-031-11644-5_1

Whitehill, Jacob; Erfanian, Amitai. (July 2022, Springer International Publishing)

Full Text Available
How to Give Imperfect Automated Guidance to Learners: A Case-Study in Workplace Learning

https://doi.org/10.1007/978-3-031-11644-5_1

Whitehill, Jacob; Erfanian, Amitai (July 2022, International Conference on Artificial Intelligence in Education)

Full Text Available
Can the Mathematical Correctness of Object Configurations Affect the Accuracy of Their Perception?

https://doi.org/10.1109/CVPRW56347.2022.00522

Jiang, Han; Li, Zeqian; Whitehill, Jacob (June 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW))

Full Text Available
Can the Mathematical Correctness of Object Configurations Affect the Accuracy of Their Perception?

https://doi.org/10.1109/CVPRW56347.2022.00522

Jiang, Han; Li, Zeqian; Whitehill, Jacob. (June 2022, Computer Vision Foundation)

Full Text Available
From learning optimization to learner flourishing: Reimagining AI in Education at the Institute for Student‐AI Teaming (iSAT)

https://doi.org/10.1002/aaai.12158

D'Mello, Sidney K.; Biddy, Quentin; Breideband, Thomas; Bush, Jeffrey; Chang, Michael; Cortez, Arturo; Flanigan, Jeffrey; Foltz, Peter W.; Gorman, Jamie C.; Hirshfield, Leanne; et al (March 2024, AI Magazine)

Abstract The Institute for Student‐AI Teaming (iSAT) addresses the foundational question:how to promote deep conceptual learning via rich socio‐collaborative learning experiences for all students?—a question that is ripe for AI‐based facilitation and has the potential to transform classrooms. We advance research in speech, computer vision, human‐agent teaming, computer‐supported collaborative learning, expansive co‐design, and the science of broadening participation to design and study next generation AI technologies (called AI Partners) embedded in student collaborative learning teams in coordination with teachers. Our institute ascribes to theoretical perspectives that aim to create a normative environment of widespread engagement through responsible design of technology, curriculum, and pedagogy in partnership with K–12 educators, racially diverse students, parents, and other community members.
more » « less
Full Text Available

« Prev Next »

Search for: All records